Discriminative Reordering Model Adaptation via Structural Learning
نویسندگان
چکیده
Reordering model adaptation remains a big challenge in statistical machine translation because reordering patterns of translation units often vary dramatically from one domain to another. In this paper, we propose a novel adaptive discriminative reordering model (DRM) based on structural learning, which can capture correspondences among reordering features from two different domains. Exploiting both in-domain and out-of-domain monolingual corpora, our model learns a shared feature representation for cross-domain phrase reordering. Incorporating features of this representation, the DRM trained on out-of-domain corpus generalizes better to in-domain data. Experiment results on the NIST Chinese-English translation task show that our approach significantly outperforms a variety of baselines.
منابع مشابه
A Discriminative Syntactic Model for Source Permutation via Tree Transduction
A major challenge in statistical machine translation is mitigating the word order differences between source and target strings. While reordering and lexical translation choices are often conducted in tandem, source string permutation prior to translation is attractive for studying reordering using hierarchical and syntactic structure. This work contributes an approach for learning source strin...
متن کاملLatent Structure Discriminative Learning for Natural Language Processing
Natural language is rich with layers of implicit structure, and previous research has shown that we can take advantage of this structure to make more accurate models. Most attempts to utilize forms of implicit natural language structure for natural language processing tasks have assumed a pre-defined structural analysis before training the task-specific model. However, rather than fixing the la...
متن کاملInducing a Discriminative Parser to Optimize Machine Translation Reordering
This paper proposes a method for learning a discriminative parser for machine translation reordering using only aligned parallel text. This is done by treating the parser’s derivation tree as a latent variable in a model that is trained to maximize reordering accuracy. We demonstrate that efficient large-margin training is possible by showing that two measures of reordering accuracy can be fact...
متن کاملDiscriminative Reordering Extensions for Hierarchical Phrase-Based Machine Translation
In this paper, we propose novel extensions of hierarchical phrase-based systems with a discriminative lexicalized reordering model. We compare different feature sets for the discriminative reordering model and investigate combinations with three types of non-lexicalized reordering rules which are added to the hierarchical grammar in order to allow for more reordering flexibility during decoding...
متن کاملDiscriminative Reordering Model for Machine Translation
We have built a discriminative reordering model for the phrase-based machine translation system Moses, which is developed at the University of Edinburgh. The model is a maximum entropy classifier which incorporates a variety of feature functions to predict phrase orientation for machine translation. Two kinds of features reported in literature, namely lexical features and dependency path featur...
متن کامل